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METHOD AND APPARATUS FOR IDENTIFYING 
SEMANTIC STRUCTURES FROM TEXT 

BACKGROUND OF THE INVENTION 
The present invention relates to natural 
5 language processing. In particular, the present 
invention relates to processing text to identify the 
semantics of the text. 

The goal of natural language processing is 
to decipher the meaning or semantics of a free-form 

10 linguistic input. Some systems attempt to identify 
the semantics of a natural language input by applying 
semantic rules directly to the individual words in 
the input. Since words can have multiple meanings, 
these rules are very complex and difficult to 

15 develop. For example, the word "room" can be a verb 
as in "Bill asked if he could room with Jake" or a 
noun as in "The table is in the living room". To 
determine which sense of the word is being used, the 
rules have to determine what other words are present 

20 in the text. Because a large number of words can be 
used with each sense of a word, a large number .of 
rules are needed. As a result, a direct application 
of semantic rules to words in the text is considered 
to be unworkable in many environments. 

25 To reduce the number of rules that are 

heeded, many systems perform a syntactic analysis to 
identify the parts of speech of the words in the text 
and the syntactic relationships between the words 
before identifying the semantics of the text. The 

30 parts of speech can then be used as conditions in the 
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rules instead of using the words directly. Thus, 
instead of having to list every possible noun in a 
rule, the rule can be written to simply require a 
noun. This normalization greatly reduces the 

5 complexity of the semantic rules. 

However, using a separate syntactic parse 
produces a two-stage evaluation process in which the 
content of the entire sentence must be fully 
considered in each stage. This makes the process 

10 slow. In addition, if the syntactic parse fails due 
to a poorly formed, fragmentary, or erroneous input, 
the semantic process will also fail. 

In addition, the semantic rules have been 
difficult to write because they have been written as 

15 a series of logical comparisons and operations. This 
makes the addition of new semantic structures time 
consuming. 

SUMMARY OF THE INVENTION 

20 A method and apparatus for identifying a 

semantic structure from an input text forms at least 
two candidate semantic structures. A semantic score 
is determined for each candidate semantic structure 
based on the likelihood of the semantic structure. A 

25 syntactic score is also determined for each semantic 
structure based on the position of a word in the text 
and the position in the semantic structure of a 
semantic entity formed from the word. The syntactic 
score and the semantic score are combined to select a 

30 semantic structure for at least a portion of the 



-3- 

text. In many embodiments, the semantic structure is 
built incrementally by building and scoring candidate 
structures for a portion of the text, pruning low 
scoring candidates, and adding additional semantic 
5 elements to the retained candidates. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 is a block diagram of one computing 
environment in which the present invention may be 
10 practiced. 

FIG. 2 is a block diagram of a system for 
identifying semantic structures from an input text. 

FIG. 3 is a flow diagram of a method of 
identifying semantic structures under one embodiment 
15 of the present invention. 

FIG. 4 is an example of a portion of a 
schema that defines semantic types, Markov 
probabilities, and semantic rules under one 
embodiment of the present invention. 
20 FIG. 5 is a diagram showing the creation of 

semantic entities from an input text. 

FIG. 6 is a diagram showing the effects of 
various semantic rules on the semantic entities of 
FIG. 5. 

25 FIG. 7 is a diagram of initial semantic 

structures formed from the first two semantic 
entities of FIG. 5. 

FIG. 8 is a diagram of the retained 
semantic structures with the addition of the 

30 TimeEntity semantic entity. 



-4- 

FIG. 9 is a diagram of the retained 
semantic structures with the addition of the 
ShowFlight entity. 

FIG. 10 is a diagram of the retained 
5 semantic structures with the addition of the Airline 
and Flight entities. 

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

FIG. 1 illustrates an example of a suitable 
computing system environment 100 on which the 
10 invention may be implemented. The computing system 
environment 100 is only one example of a suitable 
computing environment and is not intended to suggest 
any limitation as to the scope of use or 
functionality of the invention. Neither should, the 
15 computing environment 100 be interpreted as having 
any dependency or requirement relating to any one or 
combination of components illustrated in the 
exemplary operating environment 100. 

The invention is operational with numerous 
20 other general purpose or special purpose computing 
system environments or configurations. Examples of 
well-known computing systems, environments, and/or 
configurations that may be suitable for use with the 
invention include, but are not limited to, personal 
25 computers, server computers, hand-held or laptop 
devices, multiprocessor systems, microprocessor-based 
systems, set top boxes, programmable consumer 
electronics, network PCs, minicomputers, mainframe 
computers, telephony systems, distributed computing 
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environments that include any of the above systems or 
devices, and the like. 

The invention may be described in the 
general context of computer-executable instructions, 
5 such as program modules, being executed by a 
computer. Generally, program modules include 
routines, programs, objects, components, data 
structures, etc. that perform particular tasks or 
implement particular abstract data types. The 

10 invention is designed to be practiced in distributed 
computing environments where tasks are performed by 
remote processing devices that are linked through a 
communications network. In a distributed computing 
environment, program modules are located in both 

15 local and remote computer storage media including 
memory storage devices. 

With reference to FIG. 1, an exemplary 
system for implementing the invention includes a 
general-purpose computing device in the form of a 

20 computer 110. Components of computer 110 may 

include, but are not limited to, a processing unit 
120, a system memory 130, and a system bus 121 that 
couples various system components including the 
system memory to the processing unit 120. The system 

25 bus 121 may be any of several types of bus structures 
including a memory bus or memory controller, a 
peripheral bus, and a local bus using any of a 
variety of bus architectures. By way of example, and 
not limitation, such architectures include Industry 

30 Standard Architecture (ISA) bus. Micro Channel 
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Architecture (MCA) bus. Enhanced ISA (EISA) bus. 
Video Electronics Standards Association (VESA) local 
bus, and Peripheral Component Interconnect (PCI) bus 
also known as Mezzanine bus. 
5 Computer 110 typically includes a variety 

of computer readable media. Computer readable media 
can be any available media that can be accessed by 
computer 110 and includes both volatile and 
nonvolatile media, removable and non-removable media. 

10 By way of example, and not limitation, computer 
readable media may comprise computer storage media 
and communication media. Computer storage media 
includes both volatile and nonvolatile, removable and 
non-removable media implemented in any method or 

15 technology for storage of information such as 
computer readable instructions, data structures, 
program modules or other data. Computer storage 
media includes, but is not limited to, RAM, ROM, 
EEPROM, flash memory or other memory technology, CD- 

20 ROM, digital versatile disks (DVD) or other optical 
disk storage, magnetic cassettes, magnetic tape, 
magnetic disk storage or other magnetic storage 
devices, or any other medium which can be used to 
store the desired information and which can be 

25 accessed by computer 110. Communication media 

typically embodies computer readable instructions, 
data structures, program modules or other data in a 
modulated data signal such as a carrier wave or other 
transport mechanism and includes any information 

30 delivery media. The term ''modulated data signal" 
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means a signal that has one or more of its 
characteristics set or changed in such a manner as to 
encode information in the signal. By way of example, 
and not limitation, communication media includes 
5 wired media such as a wired network or direct-wired 
connection, and wireless media such as acoustic, RF, 
infrared and other wireless media. Combinations of 
any of the above should also be included within the 
scope of computer readable media. 

10 The system memory 130 includes computer 

storage media in the form of volatile and/or 
nonvolatile memory such as read only memory (ROM) 131 
and random access memory (RAM) 132. A basic 

input/output system 133 (BIOS) , containing the basic 

15 routines that help to transfer information between 
elements within computer 110, such as during start- 
up, is typically stored in ROM 131. RAM 132 
typically contains data and/or program modules that 
are immediately accessible to and/or presently being 

20 operated on by processing unit 120. By way of 
example, and not limitation, FIG. 1 illustrates 
operating system 134, application programs 135, other 
program modules 136, and program data 137. 

The computer 110 may also include other 

25 removable/non-removable volatile /nonvolatile computer 
storage media. By way of example only, FIG. 1 
illustrates a hard disk drive 141 that reads from or 
writes to non-removable, nonvolatile magnetic media, 
a magnetic disk drive 151 that reads from or writes 

30 to a removable, nonvolatile magnetic disk 152, and an 
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optical disk drive 155 that reads from or writes to a 
removable, nonvolatile optical disk 156 such as a CD 
ROM or other optical media. Other removable/non- 
removable, volatile/nonvolatile computer storage 
5 media that can be used in the exemplary operating 
environment include, but are not limited to, magnetic 
tape cassettes, flash memory cards, digital versatile 
disks, digital video tape, solid state RAM, solid 
state ROM, and the like. The hard disk drive 141 is 

10 typically connected to the system bus 121 through a 
non-removable memory interface such as interface 140, 
and magnetic disk drive 151 and optical disk drive 
155 are typically connected to the system bus 121 by 
a removable memory interface, such as interface 150. 

15 The drives and their associated computer 

storage media discussed above and illustrated in FIG. 
1, provide storage of computer readable instructions, 
data structures, program modules and other data for 
the computer 110. In FIG. 1, for example, hard disk 

20 drive 141 is illustrated as storing operating system 
144, application programs 145, other program modules 
14 6, and program data 147. Note that these 

components can either be the same as or different 
from operating system 134, application programs 135, 

25 other program modules 136, and program data 137. 
Operating system 144, application programs 145, other 
program modules 146, and program data 147 are given 
different numbers here to illustrate that, at a 
minimum, they are different copies. 
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A user may enter commands and information 
into the computer 110 through input devices such as a 
keyboard 162, a microphone 163, and a pointing device 
161, such as a mouse, trackball or touch pad. Other 
5 input devices (not shown) may include a joystick, 
game pad, satellite dish, scanner, or the like. 
These and other input devices are often connected to 
the processing unit 120 through a user input 
interface 160 that is coupled to the system bus, but 

10 may be connected by other interface and bus 
structures, such as a parallel port, game port or a 
universal serial bus (USB) . A monitor 191 or other 
type of display device is also connected to the 
system bus 121 via an interface, such as a video 

15 interface 190. In addition to the monitor, computers 
may also include other peripheral output devices such 
as speakers 197 and printer 196, which may be 
connected through an output peripheral interface 195. 

The computer 110 is operated in a networked 

20 environment using logical connections to one or more 
remote computers, such as a remote computer 180. The 
remote computer 180 may be a personal computer, a 
hand-held device, a server, a router, a network PC, a 
peer device or other common network node, and 

25 typically includes many or all of the elements 
described above relative to the computer 110. The 
logical connections depicted in FIG. 1 include a 
local area network (LAN) 171 and a wide area network 
(WAN) 173, but may also include other networks. Such 

30 networking environments are commonplace in offices. 
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enterprise-wide computer networks, intranets and the 
Internet. 

When used in a LAN networking environment, 
the computer 110 is connected to the LAN 171 through 
5 a network interface or adapter 170. When used in a 
WAN networking environment, the computer 110 
typically includes a modem 172 or other means for 
establishing communications over the WAN 173, such as 
the Internet. The modem 172, which may be internal 

10 or external, may be connected to the system bus 121 
via the user input interface 160, or other 
appropriate mechanism. In a networked environment, 
program modules depicted relative to the computer 
110, or portions thereof, may be stored in the remote 

15 memory storage device. By way of example, and not 
limitation, FIG. 1 illustrates remote application 
programs 185 as residing on remote computer 180. It 
will be appreciated that the network connections 
shown are exemplary and other means of establishing a 

20 communications link between the computers may be 
used. 

The present invention provides a method and 
apparatus for efficiently identifying semantic 
structures in an input text even if the input text is 
25 poorly formed or fragmented. A block diagram showing 
the system of the present invention is presented in 
FIG. 2 and a method for identifying a semantic 
structure is shown in the flow diagram of FIG. 3. 

In step 300 of FIG. 3, a schema 200 is 
30 defined that includes semantic types and 
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probabilities 202, Markov probabilities 204, and 
semantic rules 206. An example of a portion of a 
schema is shown in FIG. 4. 

In FIG . 4 , the schema is defined using an 
5 XML format that begins with <Schema> tag 400 and ends 
with </Schema> tag 402. A semantic type is defined 
within <Entity> tags such as <Entity> tags 404 and 
406. Within the opening <Entity> tag, the identity 
of the entity is listed as the value for the "Type" 

10 attribute. For instance, for <Entity> tag 400, the 
entity is identified as "Flight". The opening 
<Entity> tag also contains a "seen" attribute, which 
indicates the number of times this entity occurred in 
a set of training data. For example, the "Flight" 

15 entity was seen one-hundred twenty-one times in the 
training data. 

Between the opening and closing <Entity> 
tags, a set of <With> tags can be found such as 
<With> tags 408, 410, 412, 414, and 416. Each <With> 

20 tag includes an attribute of "EntityType" that 
identifies an entity that can appear as a child 
entity for the entity listed in the opening <Entity> 
tag. For example, <With> tag 408 indicates that a 
"TimeEntity" can appear as a child entity to the 

25 "Flight" entity found in tag 404. Each <With> tag 
also includes a "Seen" attribute that indicates the 
number of times the entity of the <With> tag was 
found as a child entity to the entity in the <Entity> 
tag. For example, tag 414 indicates that the 
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"Airline" entity was found as a child entity of 
"Flight" thirty-nine times. 

Some entities in a <With> tag are further 
defined by a restriction set in a "Type" attribute. 
5 For example^ <With> tags 410 and 412 are both for an 
entity of "City". However, tag 410 is restricted to 
a city that is a "goal" or destination for a trip, 
while tag 412 is restricted to a city that is a 
"source" or origin of a trip. 

10 Another tag within the <Entity> tag is the 

<Denoted> tag. A word or phrase between two 
<Denoted> tags is a word or phrase that appeared in 
the training data and was associated with this 
entity. For example, the word "flights" was found in 

15 the training text and was associated with the 
"flight" entity as indicated by <Denoted> tag 418. A 
<Denoted> tag can include a "seen" attribute that 
indicates the number of times the word or phrase 
between the <Denoted> tags was associated with this 

20 entity. For example, the word "flights" was 

associated with the "flight" entity seventy-nine 
times. 

Returning to FIG. 3, after the schema has 
been defined, it used in step 302 by a table 
25 construction unit 208 to construct a semantic 
structure table 210. Specifically, the semantic 
types 202 that are defined between <Entity> tags in 
the schema are used to construct semantic structure 
table 210. 
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Structure table 210 consists of a separate 
entry for each possible pair of entities in the 
schema • Each entry includes a listing of all of the 
structures that can be built in which both entities 
5 will be present. Note that these structures include 
structures where the two entities are directly linked 
together and structures where the two entities are 
connected through another entity. For example, one 
entity may be the grand-child of another entity such 

10 that there is an entity between them or the two 
entities may share a common ancestor entity. Only 
those entities that are required to form the 
connection between the two entities are listed for 
any one structure in the table. By creating this 

15 table initially, the present invention reduces the 
number of computations that must be done at run time 
to identify the different ways in which two entities 
may be connected together under the schema. 

At step 304, a text 212 is received by a 

20 semantic entity identifier 214, which identifies all 
of the entities that could be represented by the text 
at step 306. To do this, entity identifier utilizes 
the <Denoted> tags in the <entity> tags of schema 
200. If a word or phrase from the text is found in a 

25 pair of <Denoted> tags, a semantic node is created 
for the word or phrase and the entity that included 
the <Denoted> tag is added as a possible entity for 
the node. Note that a single node may have multiple 
entities. Also, if the span of two entities (the 

30 words or phrase associated with the entities) 
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partially overlap, they are considered to be part of 
the same semantic node. The result is a semantic 
entity lattice. 

FIG. 5 provides a diagram showing an 
5 example entity lattice formed through step 306. In 
FIG. 5^ the poorly formed input sentence "I am in 
Boston and I want to be in Detroit on Tuesday, show 
me United ^flights" is processed to identify the 
entity lattice. During this process, "Boston" and 

10 "Detroit" are mapped to two separate "cityname" 
entities 500 and 502, which represent two sequential 
semantic nodes. "Tuesday" is mapped to a 

"daysofweek" entity 504 at a third semantic node. 
The word "show" is mapped to four possible entities 

15 that overlap at the fourth node. Specifically, 
"show" is mapped to "ShowAirlineServed" entity 506, 
"ShowFlight" entity 508, "ListTransport" entity 510, 
and "ShowCapacity" entity 512. The words "United" 
and "flights" are mapped to "Airline" entity 514 and 

20 "Flight" entity 516, respectively, which represent 
the last two semantic nodes in the lattice. 

At step 308, semantic rules 206 are applied 
to the entities identified in step 306. Examples of 
such semantic rules are shown in the example schema 

25 of FIG. 4 between <GrammarRule> tags such as 
<GrammarRule> tags 420 and 422 and <GrammarRule> tags 
424 and 426. 

Between each pair of <GrammarRule> tags are 
one or more <Constit> tags such as <Constit> tag 428. 

30 Each <Constit> tag represents an entity that is 
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identified within the <Constit> tag by an "Entity" 
attribute. To determine if a particular grammar rule 
should execute, the entity lattice formed in step 306 
is examined to see if all of the entities listed 
5 within the <Constit> tags of the rule are present in 
separate semantic nodes along the lattice. Under 
some embodiments^ the entities must appear in the 
lattice in the same order as the <Constit> tags. For 
example, if a grammar rule contains the following 

10 sequence of tags: 

<Constit Number="0" Entity="Number" /> 
<Constit Number="l" Entity=" Flight "> 
then a "Number" entity must appear to the left of a 
"Flight" entity in the entity lattice in order for 

15 the grammar rule to execute. 

As shown above, the <Constit> tags also 
include a "Number" attribute. This attribute assigns 
a value to the entity that can be used to identify 
the entity during execution of the rule . This value 

20 is needed since two entities of the same entity type 
can be used in a single grammar rule. 

In general, each grammar rule combines the 
constituent entities found in the <Constit> tags to 
form a new entity in which the constituent entities 

25 are embedded. One type of grammar rule forms a 
parent entity where each constituent entity becomes a 
child to the parent entity. Such rules utilize a 
<Parent> tag with an "Entity" attribute that 
identifies the entity to be used as the parent. When 

30 this tag is encountered and the entities of the 
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<Constit> tag are present in the entity lattice, the 
individual entities of the <Constit> tags are removed 
and a semantic structure is generated in which the 
entities of the <Constit> tag become child entities 
5 to the entity found in the <Parent> tag. The span of 
this structure is then the combination of the span of 
the child entities. 

The grammar rules may include more 
complicated structure formations in which the 

10 entities of the <Constit> tags are combined through 
multiple level structures or the attributes of an 
entity are changed or an entity itself is changed. 
Such rules include a <Rule> tag that designates the 
final structure to be formed from the constituent 

15 entities using the values assigned through the 
"number" attribute of the <Constit> tags. For 
example: 

<Constit Number="0" Entity="City" 

Res triction=" Source" /> 
20 <Constit Number="l" Entity="City" 

Restriction="Goal" /> 
<Constit Number="2" Entity="City" 

Restriction="Goal" /> 
<Rule>{ Flight [ (Source, City, 0) (Via, City, 1) 
25 (Goal, City, 2) ] }</Rule> 

creates a "Flight" structure with a parent "Flight" 
entity, a child source city entity that is filled 
with the city entity designated as entity "0", a 
child via city entity that is filled with the city 
30 entity designated as entity "1", and a child goal 
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city entity that is filed with the city entity 
designated as entity "2". Note that in this rule, 
the city entity designated as entity "1" is changed 
from a "goal" entity to a "via" entity. 
5 In the example above, the "Restriction" 

attribute is a special type of entity. Like other 
entities, a restriction entity includes <Denoted> 
tags in its definition that indicate which words will 
generate the entity. For example, a "Goal" 

10 restriction entity is denoted by the word "to" or the 
phrase "arriving at". Restriction entities differ 
from other entities in that they modify another 
entity on the same level in the structure. Thus, 
Goal, Source, and Via modify a City entity. 

15 Together, the restriction entity and the entity being 
modified form a single entity with a restriction. In 
the discussion below, the restriction entity is often 
referred to as just a restriction. 

In the schema, if an entity accepts another 

20 entity with a restriction, the restriction is 
identified using the "Type" attribute in the <With> 
tag. For example, <With> tag 416 indicates that the 
"Flight" entity accepts a "City" entity with a "Via" 
restriction. 

25 FIG. 6 shows the results of performing the 

grammar rules found in FIG. 4 on the entity lattice 
of FIG. 5. Specifically, the grammar rule found 
within <GrammarRule> tags 424 and 426 has been 
applied to "CityName" entities 500 and 502. This 

30 produces "City" semantic structures 600 and 602 with 
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parent "City" entities 604 and 606 and child 
"CityName" entities 500 and 502. The grammar rule 
within <GrammarRule> tags 420 and 422 has been 
applied to "DaysofWeek" entity 504 to produce a 
5 "TimeEntity" structure 608 with a "TimeEntity" parent 
node 610 and a "DaysofWeek" child entity 504. 

After the semantic rules have executed, the 
first two semantic nodes are selected at step 310. 
Under most embodiments, this involves selecting the 

10 two left-most semantic nodes. 

At step 312, a structure generator 216 uses 
semantic structure table 210 to identify all of the 
possible structures that can be formed using the 
entities listed for the two selected semantic nodes. 

15 For example, if the first node contains entities A 
and B and the second node contains entities C and D, 
separate structures are identified for the entity 
pairs AC, AD, BC and BD. Note that multiple 
structures may be identified for each entity pair. 

20 This process can be performed quickly at run time 
because the possible structures for each entity pair 
have been previously identified and stored in 
structure table 210. 

Examples of structures generated from the 

25 first two entities of the modified entity lattice of 
FIG. 6 are shown in FIG. 7. Since there is only one 
entity for each of the first two semantic nodes in 
FIG. 6, there is only one entity pair to consider. 
However, the structure generator has identified six 

30 separate structures that can be formed with this 



-19- 

single entity pair. Specifically, six "Flight" 
semantic structures 700, 702, 704, 706, 708, and 710 
are identified. 

Although all of the identified semantic 
5 structures are "Flight" structures, they each use 
"City" semantic structure 600 (consisting of "City" 
entity 604 and "CityName=Boston" entity 500) and 
"City" semantic structure 602 (consisting of "City" 
entity 606 and "CityName=Detroit" entity 502) to fill 

10 different slots in the "Flight" structure. 
Specifically, each "City" semantic structure can be 
used to fill one of three slots for the restricted 
entities "Source City", "Via City", and "Goal City" 
found in the "Flight" entity. The six "Flight" 

15 structures thus represent the six different possible 
assignment combinations for the two "City" semantic 
structures . 

For example, in "Flight" structure 700, 
"City" entity 604 fills a "Source" city slot 720 and 

20 "City" entity 606 fills a "Goal" city slot 722. 
However, in "Flight" structure 702, "City" entity 604 
fills a "Goal" city slot 724 and "City" entity 606 
fills a "Source" city slot 726. 

In "Flight" structure 704, "City" entity 

25 604 fills a "Source" city slot 728 and "City" entity 
606 fills a "Via" city slot 730 while in "Flight" 
structure 706, "City" entity 604 fills a "Via" city 
slot 732 and "City" entity 606 fills a "Source" city 
slot 734. 
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Lastly, in "Flight" structure 708, "City" 
entity 604 fills a "Goal" city slot 736 and "City" 
entity 606 fills a "Via" city slot 738 while in 
"Flight" structure 710, "City" entity 604 fills a 
5 "Via" city slot 740 and "City" entity 606 fills a 
"Goal" city slot 742. 

In the semantic structures of FIG. 1, the 
"Flight" entity is a parent entity to each of the 
city entities in that each city entity is listed in a 

10 <With> tag within the definition of the "Flight" 
entity. Thus, each city entity is a child entity 
that is considered to be beneath the "Flight" entity 
in the semantic structure. 

After all of the possible structures for 

15 the two nodes have been created, the structures are 
provided to a structure scoring module 218, which 
scores each structure at step 314. Under the present 
invention, the score for each structure is the 
product of three semi-orthogonal scores. One of 

20 these scores is based on the likelihood of the 
semantic structure, another of the scores is based on 
the order in which semantic entities were expressed 
in the input text, and the last score is based on the 
syntactic correctness of the semantic structure. 

25 The first score is referred to as a 

semantic score and is calculated as: 

Semscore = f][ p(Ej ) Y[ Pic^ \Ej) Eq . 1 
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where Ej are the separate entities in the entity 
structure S, is all of the children entities of 

entity Ej , p{Ej) is the probability of entity Ej and 
p{c-\Ej) is the probability of child entity c,. given 
5 entity Ej . 

The probabilities in the semantic 
probability are determined from the schema using the 
"seen" attributes in the <Entity> tags and the <With> 
tags. Specifically, the probability of an entity 
10 p{Ej) is determined by dividing the value of the 

"Seen" attribute in the <Entity> tag for that entity 
by the sum of the values for "Seen" attributes in the 
<Entity> tags of all of the entities in the schema. 
The probability of a child entity given the parent 
15 entity p{Ci\Ej) is determined by dividing the value of 

the "Seen" attribute in the <With> tag for the child 
entity by the value of the "Seen" attribute in the 
<Entity> tag of the parent entity. 

In some embodiments of the present 

20 invention, the semantic score of Equation 1 is 
augmented with a penalty factor that is applied to 
reduce the probability of an entity if the entity was 
not derived directly from the text but instead was 
created to connect two entities that were derived 

25 from the text. For example, in FIG. 7, the 
probability of "Flight" entity 750 would be reduced 
by a penalty factor since the "Flight" entity was 
placed in the semantic structure to connect the two 
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"city" entities and was not taken directly from the 
entity lattice produced from the text. In some 
embodiments, the penalty factor is .90. 

The second score, which is based on the 
5 order in which the semantic entities are expressed in 
the input text, is referred to as a Markov score or a 
transition score. Under one embodiment, the Markov 
score is calculated as the product of transition 
probabilities for each pair of entities that are at 

10 the same level in the semantic structure. For 
example, the Markov probability for semantic 
structure 700 of FIG. 7 would be calculated as simply 
the transition probability for having a source city 
followed by a goal city. If there were another child 

15 node on the same level after the goal city, an 
additional transition probability would be determined 
for having a goal city followed by the additional 
entity. 

The transition probability for a pair of 
20 same-level entities is determined from 

<TransitionProb> tags (denoted as Markov 

Probabilities 204 in FIG. 2) within schema 200. 
Examples of such <TransitionProb> tags are found in 
FIG. 4 as <TransitinProb> tags 430 and 432. 
25 Within the <TransitionProb> tags the order 

of the entities is denoted using the "Rl", "El", 
"R2", and "E2" attributes. The "Rl" and "El" 
attributes identify the restriction and the entity, 
respectively, for the left-most entity of the two 
30 entities and the "R2" and "E2" attributes identify 
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the restriction and the entity for the right-most 
entity of the two entities . Thus, the transition 
probability in <Transitionprob> tag 430 is for a Goal 
City followed by a Source City and the transition 
5 probability in <Transitionprob> tag 432 is for a 
Source City followed by a Goal City. Each 
<Transitionprob> tag also includes a "Seen" 
attribute, which indicates the number of times the 
two entities were found together in the order listed 

10 in the tag. • 

To determine a probability for a specific 
order of two entities, all of the <Transitionprob> 
tags that contain the two entities are located. 
(There will either be one or two such tags) . The 

15 probability of a specific ordering of the two 
entities is determined by dividing the "Seen" value 
for that ordering by the sum of the "Seen" values for 
all of the <Transitionprob> tags that include the two 
entities. For example the probability of a Goal City 

20 followed by a Source City in FIG. 4 is calculated as 
2/71 while the probability of a Source City followed 
by a Goal City is calculated as 69/71. 

The product of the individual transition 
probabilities for each pair of same-level entities is 

25 then determined to form the Markov probability for 
the structure. Note that although only two-entity 
transition probabilities are used above, any number 
of entities may be used to determine the transition 
probabilities . 
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The last probability, which indicates the 
syntactic correctness of the semantic structure, is 
referred to as the syntactic score . This score can 
be generated in a number of different ways. Under 
5 one embodiment, a syntactic parse structure is formed 
based on the semantic structure. Specifically the 
words spanned by the entities in the semantic 
structure are placed in a syntactic structure in the 
same relative position as the entities in the 

10 semantic structure. A score for the resulting 
syntactic parse tree is then determined and used as 
the syntactic score for the semantic structure. 

Under another embodiment of the present 
invention, the syntactic score is calculated by 

15 determining a separate score for each parent-child 
relationship in the semantic structure. For each 
parent-child relationship, the token position of the 
word or phrase that is spanned by the parent entity 
and the token position of the word or phrase that is 

20 spanned by the child entity are identified, where the 
token position is the number of tokens from the 
beginning of the sentence to the word or phrase. A 
token includes individual words or a phrase that 
operates as a single entity. For example. New York 

25 City is a single token. 

The token positions are then used to 
identify a probability that describes the likelihood 
of a parent entity being at token position X when a 
child entity is at token position Y. Thus, the 

30 syntactic score is calculated as: 



Synscore = Y\ p{^^ Eq . 2 

where is the set of all parent entities, Cy is the 
set of child entities for each parent entity, X is 
the token position of the parent entity and Y is the 
token position of the child entity. 

The product of the semantic score, the 
Markov score, and the syntactic score is the full 
score for each semantic structure. By integrating 
these three semi-orthogonal scores, it is possible to 
identify semantic structures from even poorly formed 
text because although the candidate structures 
generated from such poorly formed sentences may have 
low syntactic scores, they will still be scored 
semantically and the semantic scores may be high 
enough for at least one of the structures to make it 
a viable semantic structure. In addition, by 

integrating a syntactic component and a semantic 
component of the score during the progressive 
building of the semantic structure, the present 
invention greatly reduces the amount of time needed 
to identify a semantic structure from the text. 

After the semantic structures have been 
scored at step 314, one or more of the structures is 
pruned from consideration at step 316 by a pruning 
system 220. Such pruning can be performed by taking 
the top N semantic structures or by taking those 
semantic structures within a threshold of the top 
scoring semantic structure. For the example of FIG. 
7, the top four semantic structures are retained. 
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Specif ically, structures 700, 704, 708, and 710 are 
retained. 

After pruning, the lattice is examined at 
step 318 to determine if there are any other semantic 
5 nodes to process. If there are additional semantic 
nodes to process, the entities at the next node are 
combined with the retained semantic structures by 
structure generator 216 to form all possible 
combinations of the entities and the retained 

10 structures at step 320. 

Under one embodiment, an ordered search is 
performed to determine how an entity can be attached 
to a semantic structure. First, a search is 
performed to find structures in which the new entity 

15 fits below any of the entities in the semantic 
structure. Next, a search is performed to find 
structures in which the head of the semantic 
structure fits beneath the new entity. Lastly, a 
search is performed to see if there is a common 

20 ancestor between the head of the semantic structure 
and the new entity. Each of these searches is 
performed using semantic structure table 210. 

Examples of the new structures formed from 
the structures of FIG. 7 and the "TimeEntity" entity 

25 are shown in FIG. 8. Specifically, structures 800, 
802, 804, and 806 have been formed from structures 
700, 704, 708, and 710, respectively. In all of the 
structures, the "TimeEntity" entity has been added as 
a child node to the "Flight" entity. 
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Steps 314 and 316 are repeated to score 
each of the semantic structures and to prune the set 
of structures. Since there are only four structures 
in the example of FIG, 8, none of the structures are 
5 pruned. 

Steps 318 and 320 are then performed to 
combine the structures of FIG. 8 with 
"ShowAirlineServed" entity 506, "ShowFlight" entity 
508, "ListTransport" entity 510, and "ShowCapacity" 

10 entity 512 of the next semantic node. 

Of the four entities at the next semantic 
node, only the "ShowFlight" entity can be combined 
with the structures of FIG. 8. In addition, the 
"ShowFlight" entity can only be combined to each of 

15 the structures such that the "ShowFlight" entity 
becomes the head of the structure with the "Flight" 
entity as its immediate child. This results in 
semantic structures 900, 902, 904, and 906 of FIG. 9, 
which correspond to structures 800, 802, 804, and 806 

20 of FIG. 8. 

Scoring and pruning steps 314 and 316 are 
repeated for new semantic structures 900, 902, 904, 
and 906 before determining that there are more 
semantic nodes to process at step 318. The "Airline" 

25 entity is then added to semantic structures 900, 902, 
904, and 906 at step 320. For each of these 
structures, this involves adding the "Airline" entity 
as a child node to the "Flight" entity. The 
resulting structures are shown as semantic structures 

30 1000, 1002, 1004, and 1006 in FIG. 10. 
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Steps 314, 316 and 318 are then repeated 
for the new structures before adding the "Flight" 
entity to the structures of FIG. 10. Since a 
"Flight" entity was created to connect the two city 
5 entities, a "Flight" entity does not need to be added 
to the structures. However, the fact that a "Flight" 
entity was found in the lattice means that the scores 
for the semantic structures no longer include a 
penalty factor for the creation of the "Flight" 
10 entity. 

After scoring and pruning the new 
structures, there are no more semantic nodes to 
process at step 318 and the process of FIG. 3 
continues at step 322 where the highest-scoring 

15 structure is examined by a command identifier 222 to 
determine if it is a command structure. A command 
structure will be headed by an entity with a 
"Command" attribute that has been set to "True" in 
schema 200. If the highest-scoring semantic 

20 structure is a command structure, the structure is 
returned at step 324 as the command represented by 
the text. 

If the highest-scoring semantic structure 
is not a command entity at step 322, command 
25 identifier 222 attempts to identify commands that can 
accept the highest-scoring structure at step 326. 
The highest-scoring structure is then incorporated 
into each of these command structures and the command 
structures are scored at step 328. The highest- 
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scoring command structure is then returned as the 
command represented by the text. 

Although the present invention has been 
described with reference to particular embodiments, 
5 workers skilled in the art will recognize that 
changes may be made in form and detail without 
departing from the spirit and scope of the invention. 
In particular, the scoring functions may be modified 
or combined within the scope of the present 
10 invention. 



